[v10] Add character count `countFunction` option by colinrotherham · Pull Request #1897 · nhsuk/nhsuk-frontend

colinrotherham · 2026-04-20T11:42:00Z

Description

This PR adds a new countFunction option to character counts to cater for server-side differences in:

Line length differences due to \n versus \r\n
Word counts that vary based on empty spaces and punctuations
Trimming empty space before counting
Native multi-byte string support

For example, services might already count multi-byte strings server-side (e.g. len() in Python) resulting in client-side count mismatches, yet upcoming support for grapheme countType is blocked by a 3rd party library integration

Support for a custom countFunction means teams can close this support gap

Checklist

Tested against our testing policy (Resolution, Browser & Accessibility)
Follows our coding standards and style guide
CHANGELOG entry

MatMoore · 2026-04-20T14:42:29Z

+   * @satisfies {Record<string, (text: string, segmenter?: Intl.Segmenter | null) => number>}
+   */
+  static countFunctions = Object.freeze({
+    characters(text, segmenter) {


Some thoughts on the interface here...

Currently there are two things that influence the choice of algorithm: the selection of countFunction, and whether segmenter is set or not. Wondering if it would be simpler to have explicit functions for graphemes as well as characters and words? In that case passing countType: "graphemes" would be syntactic sugar for passing countFunction: countFunctions.graphemes and the same goes for "characters" and "words".

It also seems slightly inconsistent that if you pass countType: "words" with a custom function, we pass through a segmenter that can do word segmentation, but if you don't pass the custom function, it doesn't use the segmenter.

Maybe if the interface is simplified to fn(text) the segmenter could be an internal implementation detail?

Yeah, I partly did this because Intl.Segmenter also supports granularity: "word" 😮

So instead of countType: "words" we could keep maxwords but set granularity: "word"

I'll see what feedback comes from the GOV.UK Design System team

Maybe if the interface is simplified to fn(text) the segmenter could be an internal implementation detail?

Yeah, I'm glad you said. I reckon for now let's ditch the static property and keep it all internal, especially with a potential performance cost to creating a new segmenter on every keystroke (hence injecting it in)

Feedback applied, thanks @MatMoore

Regarding word counts via the segmenter, I've messaged @romaricpascal and added a comment as the Unicode Default Word Boundary Specification varies quite significantly from /\S+/g

For example, consider this phrase:

My mother-in-law—Wait, what?

It matches only 3 words currently:

["My", "mother-in-law—Wait,", "what?"]

Yet using the segmenter with granularity: "word" it matches 6 words:

["My", "mother", "in", "law", "Wait", "what"]

Yeah, I'm glad you said. I reckon for now let's ditch the static property and keep it all internal, especially with a potential performance cost to creating a new segmenter on every keystroke (hence injecting it in)

Yeah I figured we wouldn't want to instantiate one each call. The way you've done it is what I was imagining.

That unicode word boundary spec seems like a cleaner way to do word counting if we were implementing from scratch, but I guess it's a potentially breaking change so maybe it's not worth it if we're keeping maxwords. Are we going to wait for the govuk implementation before we merge this?

Thanks. Yeah it'll be discussed at their next dev catch up

What we know is that they've proposed for maxwords to be deprecated

We could take the countType option as an opt-in to use segmenter. But as this comment mentions that might mean upgrading /\S+/g to a better word boundary regex for older browsers that lack support

They've not had chance to discuss anything character count related other than this comment:

Add character count Intl.Segmenter support with customisable count function alphagov/govuk-frontend#6995 (review)

But that applies to #1899 primarily

For this PR all count functions now use the same interface so it's ready for another review

sonarqubecloud · 2026-05-11T13:54:45Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
95.2% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

colinrotherham · 2026-05-12T10:15:16Z

@anandamaryon1 I'll be adding a changelog entry into #1899 to wrap up this stack of work:

With this initial PR used to uplift some more recent GOV.UK Frontend changes:

[v10] Refactor character count method to reduce repeated updates #1892

Plus this PR to allow components to configure functions within their options:

[v10] Add support for functions in component configs #1896

We've contributed our improvements back to GOV.UK Frontend via:

Add character count Intl.Segmenter support with customisable count function alphagov/govuk-frontend#6995

colinrotherham added Enhancement: feature request New feature or request character count labels Apr 20, 2026

colinrotherham temporarily deployed to nhsuk-frontend-pr-1897 April 20, 2026 11:42 Inactive

colinrotherham changed the title ~~[v10] Add character count support for countFunction option~~ [v10] Add character count countFunction option Apr 20, 2026

colinrotherham mentioned this pull request Apr 20, 2026

Character count's character/word count functions should be customisable alphagov/govuk-frontend#1364

Open

colinrotherham force-pushed the component-config-functions branch from 5562f44 to c3c6fc7 Compare April 20, 2026 12:42

colinrotherham force-pushed the character-count-custom-function branch from 272d552 to 551bf35 Compare April 20, 2026 12:42

colinrotherham temporarily deployed to nhsuk-frontend-pr-1897 April 20, 2026 12:43 Inactive

MatMoore reviewed Apr 20, 2026

View reviewed changes

colinrotherham force-pushed the character-count-custom-function branch from 551bf35 to 921cd37 Compare April 21, 2026 07:36

colinrotherham temporarily deployed to nhsuk-frontend-pr-1897 April 21, 2026 07:37 Inactive

colinrotherham force-pushed the character-count-custom-function branch from 921cd37 to 6b80700 Compare April 21, 2026 07:40

colinrotherham temporarily deployed to nhsuk-frontend-pr-1897 April 21, 2026 07:40 Inactive

colinrotherham added this to Service Manual Sprint Board Apr 21, 2026

colinrotherham moved this to Needs review in Service Manual Sprint Board Apr 21, 2026

colinrotherham force-pushed the character-count-custom-function branch from 6b80700 to 07676d2 Compare April 22, 2026 13:49

colinrotherham temporarily deployed to nhsuk-frontend-pr-1897 April 22, 2026 13:49 Inactive

colinrotherham force-pushed the component-config-functions branch from c3c6fc7 to e1a7744 Compare April 22, 2026 13:53

colinrotherham force-pushed the character-count-custom-function branch from 07676d2 to 649b198 Compare April 22, 2026 13:54

colinrotherham temporarily deployed to nhsuk-frontend-pr-1897 April 22, 2026 13:54 Inactive

colinrotherham force-pushed the component-config-functions branch from e1a7744 to 9ac9e9b Compare April 27, 2026 08:37

colinrotherham force-pushed the character-count-custom-function branch from 649b198 to e76c2e7 Compare April 27, 2026 08:37

colinrotherham temporarily deployed to nhsuk-frontend-pr-1897 April 27, 2026 08:38 Inactive

colinrotherham force-pushed the component-config-functions branch from 9ac9e9b to 93515cf Compare April 28, 2026 11:30

colinrotherham force-pushed the character-count-custom-function branch from e76c2e7 to 6c80311 Compare April 28, 2026 11:32

colinrotherham had a problem deploying to nhsuk-frontend-pr-1897 April 28, 2026 11:32 Failure

colinrotherham force-pushed the character-count-custom-function branch from 6c80311 to 3290d27 Compare April 28, 2026 11:47

colinrotherham temporarily deployed to nhsuk-frontend-pr-1897 April 28, 2026 11:47 Inactive

colinrotherham force-pushed the component-config-functions branch from 93515cf to 906201e Compare April 28, 2026 12:01

colinrotherham changed the base branch from component-config-functions to character-count-grapheme April 28, 2026 12:05

colinrotherham force-pushed the character-count-custom-function branch from 3290d27 to 9492e12 Compare April 28, 2026 12:06

colinrotherham temporarily deployed to nhsuk-frontend-pr-1897 April 28, 2026 12:06 Inactive

colinrotherham force-pushed the character-count-grapheme branch 2 times, most recently from eb6e5ba to 4365f61 Compare April 28, 2026 14:13

colinrotherham force-pushed the character-count-custom-function branch from 9492e12 to e93b9da Compare April 28, 2026 14:15

colinrotherham temporarily deployed to nhsuk-frontend-pr-1897 April 28, 2026 14:15 Inactive

colinrotherham linked an issue Apr 28, 2026 that may be closed by this pull request

Character count component counts code points, not characters #1619

Open

colinrotherham mentioned this pull request Apr 28, 2026

Add character count Intl.Segmenter support with customisable count function alphagov/govuk-frontend#6995

Open

colinrotherham force-pushed the character-count-grapheme branch 3 times, most recently from c7b7bbe to 1da2682 Compare May 8, 2026 16:18

Base automatically changed from character-count-grapheme to support/10.x May 8, 2026 16:25

colinrotherham added this to the v10.5.0 milestone May 8, 2026

Configure ESLint to allow ES2021 ??= logical assignment

248047d

colinrotherham force-pushed the character-count-custom-function branch from e93b9da to 078f07a Compare May 11, 2026 09:47

colinrotherham temporarily deployed to nhsuk-frontend-pr-1897 May 11, 2026 09:48 Inactive

colinrotherham added 2 commits May 11, 2026 14:48

Expose character count countFunctions

db1eda4

Add character count support for countFunction option

7b1a233

colinrotherham force-pushed the character-count-custom-function branch from 078f07a to 7b1a233 Compare May 11, 2026 13:49

colinrotherham temporarily deployed to nhsuk-frontend-pr-1897 May 11, 2026 13:49 Inactive

Smartin684 assigned anandamaryon1 May 11, 2026

MatMoore approved these changes May 12, 2026

View reviewed changes

colinrotherham merged commit 0533db3 into support/10.x May 12, 2026
24 checks passed

colinrotherham deleted the character-count-custom-function branch May 12, 2026 10:15

github-project-automation Bot moved this from Needs review to Ready to release in Service Manual Sprint Board May 12, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[v10] Add character count `countFunction` option#1897

[v10] Add character count `countFunction` option#1897
colinrotherham merged 3 commits into
support/10.xfrom
character-count-custom-function

colinrotherham commented Apr 20, 2026 •

edited

Loading

Uh oh!

MatMoore Apr 20, 2026 •

edited

Loading

Uh oh!

colinrotherham Apr 20, 2026 •

edited

Loading

Uh oh!

colinrotherham Apr 21, 2026 •

edited

Loading

Uh oh!

MatMoore Apr 21, 2026 •

edited

Loading

Uh oh!

colinrotherham Apr 21, 2026 •

edited

Loading

Uh oh!

colinrotherham May 11, 2026

Uh oh!

sonarqubecloud Bot commented May 11, 2026

Uh oh!

colinrotherham commented May 12, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

colinrotherham commented Apr 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklist

Uh oh!

MatMoore Apr 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

colinrotherham Apr 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

colinrotherham Apr 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

MatMoore Apr 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

colinrotherham Apr 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

colinrotherham May 11, 2026

Choose a reason for hiding this comment

Uh oh!

sonarqubecloud Bot commented May 11, 2026

Quality Gate passed

Uh oh!

colinrotherham commented May 12, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

colinrotherham commented Apr 20, 2026 •

edited

Loading

MatMoore Apr 20, 2026 •

edited

Loading

colinrotherham Apr 20, 2026 •

edited

Loading

colinrotherham Apr 21, 2026 •

edited

Loading

MatMoore Apr 21, 2026 •

edited

Loading

colinrotherham Apr 21, 2026 •

edited

Loading